Akshaya: A Framework for Mining General Knowledge Semantics From Unstructured Text
نویسندگان
چکیده
We report a tool called Akshaya, which implements a framework to mine four types of “general knowledge semantics” (analytical semantics) from unstructured text. The semantics being mined are semantic siblings, topical anchors, topic expansion and topical markers. The framework provides options to embed more such general knowledge semantic mining algorithms into it. We use a term co-occurrence graph representation of unstructured text corpora to mine these semantics relations between terms. The semantic mining algorithms use different graph algorithms like random walk, graph clustering and so on to mine semantic relations. The tool can currently read plain text documents and generate a term co-occurrence graph and perform semantic association mining on it.
منابع مشابه
Sentiment Analysis Meets Semantic Analysis: Constructing Insight Knowledge Bases
Numerous Web 2.0 applications collect user opinions, and other user-generated content in the form of product reviews, discussion boards, and blogs, which are often captured as unstructured data. Text mining techniques are important for analyzing users’ opinions (sentiment analysis) and identifying topics of interest (semantic analysis). However, little work has been carried out that combines se...
متن کاملText Mining: Promises and Challenges
Text mining, also known as knowledge discovery from text, and document information mining, refers to the process of extracting interesting patterns from very large text corpus for the purposes of discovering knowledge. Text mining is an interdisciplinary field involving information retrieval, text understanding, information extraction, clustering, categorization, visualization, database technol...
متن کاملText Mining -knowledge Extraction from Unstructured Textual Data
In the general context of Knowledge Discovery, speciic techniques , called Text Mining techniques, are necessary to extract information from unstructured textual data. The extracted information can then be used for the classiication of the content of large textual bases. In this paper, we present two examples of information that can be automatically extracted from text collections: probabilisti...
متن کاملخوشهبندی اسناد مبتنی بر آنتولوژی و رویکرد فازی
Data mining, also known as knowledge discovery in database, is the process to discover unknown knowledge from a large amount of data. Text mining is to apply data mining techniques to extract knowledge from unstructured text. Text clustering is one of important techniques of text mining, which is the unsupervised classification of similar documents into different groups. The most important step...
متن کاملA Framework for Designing Knowledge Map using Text-mining method
Knowledge map is a representation tool to visualize knowledge sources and relationships among knowledge artifacts, which is considered as a core element of knowledge management system. To construct a knowledge map, we need to identify which category each newly registered knowledge artifact is mapped into, called structured knowledge. However, in view of fully utilizing obtainable knowledge, it ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014